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The third enactment of Cambridge 
Healthtech Institute's Macroresults 
through Microarroys meeting was held 
in Boston (MA, USA) from 29 April- 
1 May 2002. The subtheme of this year's 
meeting was 'advancing drug discov- 
ery', a widely touted application for 
array technology. 

The evolution of microarrays 

If you were asked 'Who first conceived 
of the idea of microarrays', who would 
come to mind? Mark Schena perhaps, 
first author of the seminal ! 995 paper 
on cDNA arrays [1]? Maybe Pat Brown, 
Schena's then supervisor? Or perhaps 
Stephen Fodor, the primary driver 
behind Affymetrix's ' (http://www. 
affymetrix.com) oligonucleotide-based 
platform [2]. Brits might even chant the 
name of Ed Southern [3]. Well, accord- 
ing to Roger Ekins (University College 
London Medical School; http://www. 
ucl.ac.uk/medicine/) all these answers 
would be wrong. It was in fact Ekins 
and his colleagues who first conceived 
of and patented 'a new generation of 
ultrasensitive, miniaturized assays for 
protein and DNA-RNA measurement 
based on the use of microarrays' in the 
mid 1980s [4], The concept and poten- 
tial of array technology was more fully 
described in a later publication, in 
which Ekins et al. [5] concluded that an- 
tibody microspots of -50 urn' could be 
achieved, and that as many as 2 million 
different immunoassays could, in prin- 
ciple, be accommodated on a surface 
area of 1 cm 2 . 

Technological innovation 

In practice, it took a different biological 

molecule (DNA), a different research 



group, and a leap into microfabri- 
cation technology to even begin 
approaching these kinds of densities 
[Affymetrix patent 6045996 talks of 
one million spots cm- 2 ]. Of course, 
advancing technology is one of the 
driving engines behind the genomics 
juggernaut, and we are already seeing 
'4th generation' machines for fab- 
ricating DNA chips. If the company 
representatives at this meeting are to 
be believed (and their cases seemed 
strong), spotting is out, and in situ 
fabrication of oligonucleotide-based 
'iterative custom arrays' is in. Whether 
you go with the Corhbimatrix's (http:// 
www.combimatrix.com) electrochemi- 
cally directed synthesis and detection 
system, febit's (http://www.febit.com) 
Geniom® technology, or Nimblegen's 
(http://www.nimblegen.com) Maskless 
Array Synthesizer technology is a 
matter of personal choice. However, 
each of these machines provides the 
flexibility to design variable length 
oligonucleotide probes from se- 
quences inputted by the user, and then 
perform in situ synthesis of an array. 
Each system also boasts unique advan- 
tages. For example, Combimatrix's 
biological array processor is a semi- 
conductor coated with a 3D layer 
of porous material in which DNA, 
RNA, peptides or small molecules 
can be synthesized or immobilized 
within discrete test sites, while febit's 
Ceniom One® is a fully integrated 
gene-expression analysis system with 
minimal user hands-on time - the 
probe sequences are programmed, the 
RNA samples inserted, and the gene 
expression data is pumped out a few 
hours later. 



Cell- and tissue-based arrays 
Array technology is in most people's 
minds firmly linked with gene-expression 
profiling. Fewer are aware that cell- and 
tissue-based arrays have been devel- 
oped, and how they can provide 
a vital extra dimension to research. In 
support of this, Barry Bochner gave an 
update on the cell-based array system 
that Biolog (http://www.biolog.com) 
has produced for simultaneously mea- 
suring the effects of one gene in the cell 
under thousands of growth conditions 
(see [6] for further details). David Walt 
(Tufts University; http://www. tufts, 
edu/) is developing single live cell ar- 
rays using optical imaging fiber (OIF) 
technology. An array of microwells is 
fabricated on the face of an OIF at den- 
sities of up to 10 million wells cm- 2 . 
Cells are then added to the wells and 
disperse at an average of one cell per 
well. Physiological and genetic re- 
sponses of each cell are measured via 
fluorescence produced by reporter 
genes (e.g. /ocZ, gfp. Assays performed 
so far include yeast live or dead cell 
assay, microenvironment pH and 
0 2 measurements, promotor responses 
using the locZ and phoA reporter genes, 
and protein-protein interactions using 
the yeast two-hybrid system. The main 
advantage of this system is that the cells 
remain alive during the assay, which 
means a real-time timecourse can be 
performed and/or the array passed 
from sample to sample. This would be 
useful in, for example, the scanning of 
a combinatorial drug library for specific 
physiological effects. 

Tissue arrays are a useful complemen- 
tary technology to DNA arrays because 
they can be used to help validate and 
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understand the biological and medical 
significance of gene changes discov- 
ered using standard DNA arrays. For 
example, an array of tumor tissues can 
be screened for the protein (using im- 
munohistochemistry), message (using 
in situ hybridization) and copy number 
(using comparative genomic hybridiza- 
tion) of a gene of interest, to determine 
if expression of the gene (or lack 
thereof) is related in any way to sur- 
vival. They can also be used to predict 
the probability of clinical failure of lead 
compounds as a result of toxicity by 
evaluating the distribution of the drug 
targets in normal tissue. Spyro Mousses 
and his co-workers at the National 
Human Genome Research Institute 
(http://www.nhgri.nih.gov/index.html) 
have built such arrays, including a 
multi-tumor array (-5000 specimens, 
and sections from 36 normal and 800 
metastatic tissues) and a normal tissue 
array (76 tissue and 332 cell types). 

The problem with proteins 

It has been said that genomics tells us 
what might happen, transcriptomics 
indicates what should happen, and pro- 
teomics shows what is happening. The 
impact of functional proteomics on 
pharmaceutical R&D is rapidly increas- 
ing, and protein arrays are being used 
increasingly in both basic and applied 
research. Their use lies not only in com- 
parative protein expression and inter- 
action profiling, but also in diagnostics 
and drug discovery. However, an in- 
creasing number of researchers have 
found that protein arrays, like their 
cousins the DNA arrays, present several 
practical obstacles relating to their pro- 
duction and use. For example, in using 
Escherichia coli to produce recombi- 
nant eukaryotic proteins from a single 
expression vector, multiple protein 
products are often produced, suggest- 
ing mixes of truncated or otherwise 
altered proteins. There is also the obvi- 
ous concern that the proteins might 
not be modified in a similar manner to 



eukaryotic systems. Also, an optimal 
method for depositing and binding 
proteins to the selected substrate is 
yet to be determined, as is the best 
way to ensure that they are bound in a 
correctly folded, active conformation. 

Several companies have been address- 
ing these problems. Prolinx (http:// 
www.prolinxinc.com) is one such com- 
pany, and Karin Hughes described their 
Versalinx™ chemistry for producing 
protein, peptide and small-molecule 
arrays. Versalinx™ uses solution-phase 
conjugation followed by immobiliza- 
tion, resulting in functional orientation 
of proteins and peptides on the sub- 
strate surface. It also offers the valuable 
additional benefit of exhibiting low 
non-specific binding. Sense Proteomic 
(http://www.senseproteomic.com) is 
also among those addressing these 
problems to develop robust protein 
arrays for drug discovery and clinical 
applications and has developed func- 
tional protein array formats based on 
specific disease tissues. Subtractive hy- 
bridization is used to identify genes 
with altered expression in breast tumor 
and cystic fibrosis compared to normal 
tissue. A high throughput cloning strat- 
egy (COVET™) is then used to produce 
libraries of genes that are tagged, 
cloned, expressed, purified and finally 
immobilized on glass slides. Initial vali- 
dation studies have shown that the vast 
majority of the immobilized proteins do 
indeed retain biological function. 

Stefan Schmidt and his company 
(CPC Biotech; http://www.gpcbiotech. 
de) have moved past the platform devel- 
opment stage and, with their focus 
firmly on drug discovery, are currently 
developing kinase-profiling arrays. 
Kinases are important targets for phar- 
maceutical drug discovery and therapy, 
and GPC's aim is to simultaneously de- 
tect multiple kinases, obtain activity pro- 
files for different cell types, or analyze 
the ability of drug candidates to inhibit 
kinase activity. To do this, recombinant 
kinase substrates are immobilized on 



membranes, incubated with purified 
kinase, and the substrates measured for 
the degree of phosphorylation. 

Summary 

Meetings like this, packed with exciting 
discoveries and intriguing and interest- 
ing innovation, heavily emphasize the 
pace at which biotechnology is advanc- 
ing, to the extent that the number of 
options for genomic and proteomic re- 
searchers can become overwhelming. 
Although data analysis is perhaps the 
greatest current concern for array users, 
an increasing challenge will be to deter- 
mine the approaches and technology 
that really work, and to do it in a timely 
manner. 
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A standard two-dimensional (2-D) protein map of Fischer 344 rat liver 
(F344MST3) is presented, with a tabular listing of more than 1200 protein species. 
Sodium dodecyl sulfate (SDS) molecular mass and isoelectric point have been es- 
tablished, based on positions of numerous internal standards. This map has been 
used to connect and compare hundreds of 2-D gels of rat liver samples from a va- 
riety of studies, and forms the nucleus of an expanding database describing rat 
liver proteins and their regulation by various drugs and toxic agents. An example 
of such a study, involving regulation of cholesterol synthesis by cholesterol-lower- 
ing drugs and* a high-cholesterol diet, is presented. Since the map has been ob- 
tained with a widely used and highly reproducible 2-D gel system (the Iso-Dalt* 
system), it can be directly related to an expanding body of work in other laborato- 
ries. 
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1 Introduction 

High-resolution two-dimensional electrophoresis of pro- 
teins, introduced in 1975 by O'Farrell and others [1—4], has 
been used over the ensuing 16 years to examine a wide va- 
riety of biological systems, the results appearing in more 
than 5000 published papers. With the advent of computer- 
ized systems for analyzing two-dimensional (2-D) gel ima- 
ges and constructing spot databases, it is also possible to 
plan and assemble integrated bodies of information de- 
scribing the appearance and regulation of thousands of pro- 
tein gene products [5, 6]. Creating such databases involves 
amassing and organizing quantitative data from thousands 
of 2-D gels, and requires a substantial commitment in tech- 
nology and resources. 

Given the long-term effort required to develop a protein da- 
tabase, the choice of a biological system takes on consider- 
able importance. While in vitro systems are ideal for answer- 
ing many experimental questions, especially in cancer re- 
search and genetics, our experience with cell cultures and 
tissue samples suggests that some in vivo approaches could 
have major advantages. In particular, we have noticed that 
liver tissue samples from rats and mice appear.to show grea- 
ter quantitative reproducibility (in terms of individual pro- 
tein expression) than replicate cell cultures. This is perhaps 
a natural result of the homeostasis maintained in a com- 
plete animal vs. the well-known variability of cell cultures, 
the latter due principally to differences in reagents (e.g., 
fetal bovine serum), conditions (e.g.,pH) and genetic "evo- 
lution" of cell lines while in culture. It is also more difficult 
to generate adequate amounts of protein from cell culture 
systems (particularly with attached cells), forcing the inves- 
tigator to resort to radioisotope-based or silver-based stain- 
detection methods. While these methods are more sensi- 
tive (sometimes much more sensitive) than the Coomassie 
Brilliant Blue (CBB) stain typically used for protein detec- 
tion in "large" protein samples, they are generally more vari- 
able, more labor-intensive and, in the case of radiographic 
methods, may generate highly "noisy" images, due to the 
properties of the films used. By contrast, large protein sam- 
ples can easily be prepared from liver using urea/Nonidet 
P-40 (NP-40) solubilization and stained with CBB, which 
has the advantage of being easily reproducible [8]. Finally, 
there remains the question of the "truthfulness" of many in 
viiro systems as compared to their in vivo analogs; how 
great are the changes caused by the introduction into a cul- 
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has been made in the development of mouse, rat and hu- 
man hepatocvte culture systems, as well as in precision-cut 
Ssue slices/Using such an array of techniques, it is posst- 
ble to a«emble a matrix of mammalian systems including 
mouse and rat in vivo on one level and mouse, rat and hu- 
man in vitro on a second level, and to compare effects be- 
tween species and between systems. This approach allows 
u< to draw informed conclusions regarding the biochemical 
-universality" of biological responses among the mammals, 
and to offer some insight into the validity of to J/" ap- 
proaches for toxicologic^ screening. We believe this data 
will be necessary if in vitro alternatives are to achieve wide 
usage in government-mandated safety testing of drugs, con- 
sumer products and industrial and agricultural chemicals. 

A number of interesting studies have been published using 
2-D mapping to examine effects in the rodent liver. A num- 
ber of investigarors have made use of the technique to 
sc" en for existing genetic variants 18-11] or induced muta- 
tions [12-14],mainly in the mouse.This work builds on the 
wealth of genetic information available on the mouse and 
its established position as a mammalian mutatron-detec- 
on system. While some studies of chemical effects have 
been undertaken in the mouse [15-17], most have used he 
rat 118-231 The examination of the cytochrome p-450 sys- 
tem, in particular, has been carried out almost exclusively 
on the rat [24, 25]. 

These considerations lead us to conclude that rodent liver 
offers the best opportunity to systemat.caUy examine an 
anay of gene regulation systems, and ultimately to bu, d la 
predictive model of large-scale mammalian gene contro . 
The basic underlying foundation of such a project is a reli- 
able reproducible master 2-D pattern of liver, to which on- 
going experimental results can be referred.In this paper, we 
report such a master pattern for the acdic and neutral pro- 
ins o rat liver (pattern F344MST3).ln future,th.s master 
will be supplemented by maps of basic proteins, and analog- 
ous maps of mouse and human liver. 



„,„o„c the liver is an obvious choice for the study 
ifTmSl eff«« be«u S e of its well-known plasticity and 
«i!nnsiveness The brain appears to be quite plastic (e* 
Z it is i complicated mixture of cell types requiring 
? k i?uTdi ection Xmost experiments. The W»ney -"He 
oui e Responsive, also presents a potentially confounding 
Sure of cell types. The liver, by contrast, is made up of 
^nmfnant cell type which is easy to solubilize: the 
X cyU e esentingmore than 95% of its mass. Most 
Srtantly the liver performs many homeostatic func- 
ZsSreQuire rapid modulation of gene expression.lt 
anneai tot mSi chemical agents tested affect gene ex- 
nKn in th™ liver at some dosage (N. Leigh Anderson, 
^SSAtidbltr^om), an interesting contrast to our 
earto iSrk wS lymphocytes, for example, which seem to 
mnrh?«s re ionsive. Such results conform to the expec- 
Art eel s P wWi a homeostatic, physiological role 
SiSd be more Plastic than cells differentiated for a pur- 
po^e deperSem Sn «be action of a limited number of spe- 
cific genes. 

The liver also allows the parallels between in vitro and in 
?vo systems to be examined in detail. Significant progress 



2 Materials and methods 
2.1 Sample preparation 

Liver is an ideal sample material for most biochemical stud- 
io including 2-D analysis. A sample is taken of approxima- 
te y 05 g of "issue from the apical end of the left lobe of the 
Wer.'solubilization is effected as rapidly as practical; a 
delay of 5-15 min appears to cause no major alteration i n 
liver protein composition if the liver pieces are kept cold 
eT on ice) in the interim. In the solubilization process, 
the Hver sample is weighed, placed in a glass homogenizer 
(ig , 15 mL Wheaton); 8 volumes of solubitong solution* 
• Thlsolu bilizing solution is composed of 2%NP^0 (Sigma),9 m urea 
Una y t eal grade. *.|.. BDH or Bio-Rad), 0.5% dith.oihre.iol (DTT 
s'llSS Sd* * cmrrifr unpholyte. <pH 9-» 1 1-KBt tt>e.e come » • »J 
s toTk solu.ion.so 2 % r.nal concentration is ach.eved by making he f na 
olution 10%9-U Adeline byvolume) f A^ 
(several hundred mU is made and store V'°*f" ' e ^f ' ^ 
sufficient to provide enough Tor one day's esumated sample prepare 
ion requirement. The so.ution is never allowed to become : w a rm " 
than room temperature at any stage dur.ng preparat.on or 
use s n« heating of concentra.ed urea solutions can produce coot m • 
mm. thai covalently modify proteins producing amfactual charge 
shifts Once thawed, any unused solub.hier » d.scarded. 
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is added (i.e., 4 mL per 0.5 g tissue) and the mixture is ho- 
mogenized using first the loose- and then then the tight-fil- 
ling glass pestle. This takes approximately 5 strokes with 
each pestle and is cairied out at room temperature because 
urea would crystallize out in the cold. Once the liver sample 
i< thoroughly homogenized in the solubilizer, it is assumed 
that all the proteins are denatured (by the chaotropic effect 
of the urea and NP-40 detergent) and the enzymes inacti- 
vated by the high pH (-9.5). Therefore these samples may 
be kept at room temperature until they can be centrifuged 
or frozen as a group (within several hours of preparation). 
The samples are centrifuged for 6 X 10 4 g min (e.g., 500 000 
X g for 12 min using a.Beckman TL-100 centrifuge). The 
centrifuge rotor is maintained at just below room tempera- 
ture (e.g., 15-20°C), but not too cold, so as to prevent the 
precipitation of urea.The centrifuge of choice is a Beckman 
TL-100 because of the sample tube sizes available, but any 
ultracentrifuge accepting smallish tubes will suffice. When 
an appropriate centrifuge is not available near the site of 
sample preparation, samples can be frozen at -S0°C and 
thawed prior to centrifugation and collection of superna- 
lants. Each supernatant is carefully removed following cen- 
trifugation and aliquoted into at least A clean lubes for stor- 
age This is done by transferring all the supernatant to one 
clean tube, mixing this gently (to assure homogeneous 
composition) and then dividing it into A aliquots. The aJi- 
quots are frozen immediately at -80°C These multiple ali- 
quots can provide insurance against a failed run or a freezer 
breakdown. 

2.2 Two-dirneDsional electrophoresis 

Sample proteins are resolved by 2-D electrophoresis using 
the 20 X 25 cm lso-Dalt* 2-D gel system ([26-29]; pro- 
duced by LSB and by Hoefer Scientific Instruments, San 
Francisco) operating with 20 gels per batch. All first-dimen- 
sional isoelectric focusing (IEF) gels are prepared using the 
same single standardized batch of carrier ampholytes 
(BDH 4-8A in the present case, selected by LSB's batch- 
testing program for rat and mouse database work**). A 10 
uL sample of solubilized liver protein is applied to each gel, 
and the gels are run for 33 000 to 34 500 volt-hours using a 
progressively increasing voltage protocol implemented by 
a programmable high-voltage power supply. An Ange- 
lique" computer-controlled gradient-casting system (pro- 
duced by LSB) is used to prepare second-dimensional sod- 
ium dodecyl sulfate (SDS) polyacrylamide gradient slab 
gels in which the top 5 % of the gel is 1 1 %T acrylamide, and 
the lower 95% of the gel varies linearly from 11% to 18%T. 

This system has recently been modified so as to employ a 
commercially available 30.8 %T acrylamide/A^-methyle- 
nebisacrylamide prepared solution (thus avoiding the han- 
dling of the solid acrylamide monomer) and three addi- 
tional stock solutions: buffer (made from Sigma pre-set 
Tris) persulfate and A^A^AMetramethylethylenedi- 
araine (TEMED). Each, gel is identified by a computer- 
printed filter paper label polymerized into the lower left cor- 
ner of the gel. First-dimensional IEF tube gels are loaded 

•• This material (succeeding certified batches ofwhich are available from 
Hoefer Scientific Instruments) has the most linear pH gradient pro- 
duced by any ampholyie tested except for the Pharmacia wide range 
(which has an unacceptable tendency to bind high-molecular weight 
acidic proteins, causing ihem to streak). 



directly (as extruded) onto the slab gels without equilibra- 
tion, and held in place by polyester fabric wedges (Wed- 
gies", produced by LSB) to avoid the use of hot agarose. 
Second-dimensional slab gels are run overnight, in groups 
of 20, in cooled DALT tanks (10°C) with buffer circulation. 
AH run parameters, reagent source and lot information, 
and notations of deviation from expected results are ente- 
red by the technician responsible on a detailed, multi-page 
record of the experiment. 

2.3 Staining 

Following SDS-electrophoresis, slab gels are stained for 
protein using a colloidal Coomassie Blue G-250 procedure 
in covered plastic boxes, with 10 gels (totalling approxima- 
tely 1 L of gel) per box. This procedure (based on the work 
ofNeuhofT[30,31]) involves fixation in 1.5 L of 50% etha- 
nol and 2% phosphoric acid for 2 h, three 30 min washes, 
each in 2 L of cold tap water, and transfer to 1.5 L of 34% 
methanol, 17%ammonium sulfate and 2% phosphoric acid 
for 1 h, followed by the addition of a gram of powdered Coo- 
massie Blue G-250 stain. Staining requires approximately 4 
days to reach equilibrium intensity, whereupon gels are 
transferred to cool tap water and their surfaces rinsed to re- 
move any particulate stain prior to scanning. Gels may be 
kept for several months in water with added sodium azide. 
The water washes remove ethanol that would dissolve the 
stain (and render the system noncolloidal, with high back- 
grounds). The concentrated ammonium sulfate and meth- 
anol solution is diluted by equilibration with the water vol- 
ume of the gels to automatically achieve the correct final 
concentrations for colloidal staining. Practical advantages 
of this staining approach can be summarized as follows: (i) 
the low, flat background makes computer evaluation of 
small spots (max OD < 0.02) possible, especially when 
using laser densitometry; (ii) up to 1500 spots can be reli- 
ably detected on many gels {e.g., rat liver) at loadings low ■ 
enough to preserve excellent resolution; and (iii) reprodu- 
cibility appears to be very good: at least several hundred 
spots have coefficients of reproducibility less than 15%. 
This value is at least as good as previous CBB methods, and 
significantly better than many silver slain systems. 

2.4 Positional standardization 

The carbamylated rabbit muscle creatine phosphokinase 
(CPK) standards [32] are purchased from Pharmacia and 
BDH. Amino acid compositions, and numbers of residues 
present in proteins used for internal standardization, are 
taken from the Protein Identification Resource (PIR) se- 
quence database [33]. 

2.5 Computer analysis 

Stained slab gels are digitized in red light at 134 micron re- 
solution, using either a Molecular Dynamics laser scanner 
(with pixel sampling) or an Eikonix 78/99 CCD scanner. 
Raw digitized gel images are archived on high-density DAT 
tape (or equivalent storage media) and a greyscale video- 
print prepared from the raw digital image as hard-copy 
backup of the gel image. Gels are processed using the Kep- 
ler* software system (produced by LSB), a commercially 
available workstation-based software package built on 
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come of the principles of the eaiherTYCHO system [,4- 
41]. Procedure PROC00S is used to yield b spoihst gmng 
Dosition <h£ P e and density information for each delected 
'pTSs ro'cedure makes use of digital niienng, mathe- 
matical morphology techniques and o.gitaJ maskmg to re- 
move the Sk^ ground, and uses full 2-D least-squaies opti- 
^f, a ,ion to reffne the parameters of a 2-D Gaussian shape 
f iVnn *oce«ing parameters and file locations are 

Tig operation of the automatic analys.s software are ar- 
chived with the reduced oata.The computed resolution and 
level of Gaussian convergence of each gel are inspected 
and archived for quality control purposes. 

r --,»r,t nackaees are constructed using the Kepler ex- 

<° — b,e r ups of , 2 - D 

L corresponding to the experimental groups (e.«., 
ueSand control animals). Each 2-D pattern is matched 
o the appropriate "master" 2-D pattern (pattern 
F344MST3 in the case of Fischer 344 rat liver), thereby 
nroviding linkage to the existing rodent protein 2-D data- 
bases The software allows experiments containing hun- 
dreds of gels to be constructed and analyzed as a unit, with 
ud to 100 gels displayed on the screen at one time for.com- 
narative purposes and multiple pages to accommodate ex- 
periments of > 3000 gels. For each treatment, proteins 
showing significant quantitative differences vj. appropriate 
controls are selected using group-wise statistical parame- 
ters £* Student's t-test, Kepler* procedure STUDENT). 
Protein* sating various quantitative criteria (such as P< 
0 001 difference from appropriate controls) are repre- 
sented a< highlighted spots onscreen or on computer-plot- 
ted protein maps and stored as spot populations {i.e., logi- 
cal vectors) in a liver protein database. Quantitative data 
(, D ot parameters, statistical or other computed values) are 
stored as real-valued vectors in the database. Analysis of co- 
pulation is performed using a Pierson product-moment 
correlation (Kepler procedure CORREL) to determine 
whether groups of proteins are coordinated regulated by 
anv of the treatments. Such groups can be presented graphi- 
callvon a protein map,and reported together with the statis- 
tical criteria used to assess the level of coregulation. Multi- 
variate statistical analysis (e.g., principal components' ana- 
lysis) is performed on data exported to SAS (SAS Institute). 



2.6 Graphical daU output 

Graphical lesults are prepared in GKS and translated 
within Kepler* into output for any of a variety of devices. 
Linedrawing output is typically prepared as Postscript and 
printed on an Apple LaserWriter. Detailed maps presented 
here have been generated using an ultra-high-resolution 
Postscript-compatible Linoironic output device. Greyscale 
eraDhics are reproduced from the workstation screen using 
a Seikosha videoprinter. Patterns are shown in the standard 
orientation, with high molecular mass at the top and acidic 
proteins to the left. 



2.7 Experiment LSBC04 

In the study described here 12-week-old Charles River 
male F344 rats were used. Diets were prepared at LSB, 
based on a Purina 5755M Basal Purified Diet. Lovastatin 
and cholestyramine were obtained as prescription pharma- 



ElearophoreiU 1991, 12. 907-930 

ceutic&Js, ground and mixed with the diet at concentrations 
of 0.075% and 1%, respectively. The high cholesterol diet 
was Purina 5801M-A (5% cholesterol plus 1% sodium cho- 
late in the control diet). Animal work was carried out by Mi- 
crobiological Associates (Bethesda,MD). Animals were ac- 
climatized for one week on the control diet, fed test or con- 
trol diets for one week, and sacrificed on day 8. Average 
daily doses of lovastatin and cholestyramine in appropriate 
groups were 37 rog/kg/day and 5 g/kg/day, respectively, 
based on the weight of the food consumed. Liver samples 
were collected and prepared for 2-D electrophoresis accord- 
ing to the standard liver protocol (homogenization in 8 
volumes of 9 m urea, 2% NP-40, 0.5% ditbiothreitol, 2% 
LKB pH 9-11 carrier ampholytes, followed by centrifuga- 
tion for 30 min at 80 000 X g). Kidney, brain and plasma 
samples were frozen. Gels were run as described above, 
and the data was analyzed using the Kepler* system. Gels 
were scaled, to remove the effect of differences in protein 
loading, by setting the summed abundances of a large num- 
ber of matched spots equal for each gel (linear scaling). 



3 Results and discussion 

3.1 The rat liver protein 2-D map 

F344MST3 is a standard 2-D pattern of rat liver proteins, 
based on the Fischer 344 strain. This pattern was initiated 
from a single 2-D gel and extensively edited in an experi- 
ment comparing it to a range of protein loads, so as to in- 
clude both small spots and well-resolved representations of 
high-abundance spots. More than 700 rat liver 2-D patterns 
have been matched to F344MST3 in a series of drug effects 
and protein characterization experiments, and numerous 
new spots (induced by specific drugs, for instance) have 
been added as a result. A modified version including addi- 
tional spots present in the Sprague-Dawley outbred rat has 
also been developed (data not shown). Figure 1 shows a 
greyscale representation and Fig. 2 a schematic plot of the 
master pattern. More than 1200 spots are included, most of 
which are visible on typical gels loaded with 10 uL of solubi- 
lized liver protein prepared by the standard method and 
stained with colloidal Coomassie Blue. Master spot num- 
bers (MSN's) have been assigned to all proteins, and ap- 
pear in the following figures, each showing one quadrant of 
the pattern. Figure 3 shows the upper left (acidic, high 
molecular mass) quadrant, Fig. 4 the upper right (basic, 
high molecular mass) quadrant, Fig. 5 the lower left (acidic, 
low molecular mass) quadrant, and Fig. 6 the lower right 
(basic, low molecular mass) quadrant. The quadrants over- 
lap as an aid to moving between them. The gel position (in 
100 micron units), isoelectric point (relative to the CPK in- 
ternal pJ standards) and SDS molecularmass (from the cali- 
bration curve in Fig. 8) are listed for each spot (Table 1). Be- 
cause of the precision of the CPK-p7 values, these parame- 
ters can be used to relate spot locations between gel sys- 
tems more reliably than using p/ measurements expressed 
as pH. A major objective of current studies is the identifica- 
tion of all major spots corresponding to known liver pro- 
teins, as well as rigorous definitions of subcellular orga- 
nelle'contents. Of particular interest to us is the parallel de- 
velopment of identifications in the rat and mouse liver 
maps, allowing detailed comparisons of gene expression ef- 
fects in the two systems. The results of these studies will be 
presented systematically in a later edition of this database, 
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but we include here a useful series of 22 orienting identifi- 
cations as an aid to other users of the rat liver pattern (Table 
2). 

3.2 Carbamylated charge standards, computed p/s and 
molecular mass standardization 

We have previously shown that the use of a system of close- 
lv-spaced internal p/ markers (made by carbamylating a 
basic protein) oilers an accurate and workable solution to 
the problem of assigning positions in the pi dimension [32]. 
The same system, based on 36 protein species made by car- 
bamylating' rabbit muscle CPK, has been used here to as- 
sign'p/'s to most rat liver acidic and neutral proteins. The 
standards were coelectrophoresed with total liver proteins, 
and the standard spots added to a special version of the 
master pattern F344MST3. The gel A-coordinates of all 
liver protein spots lying within the CPK charge train were 
then transformed into CPK p/ positions by interpolation 
between the positions of immediately adjacent standards 
(Table Y) using a Kepler* vector procedure. 

It has proven possible to compute fairly accurate p/ values 
for many proteins from the amino acid composition [42]. 
We have attempted here to test a further elaboration of this 
approach, in which we computed pfs for the CPK standards 
themselves, based on our knowledge of the rabbit muscle 
CPK sequence and the fact that adjacent members of the 
charge train tvpically difTer by blockage of one additional ly- 
sine residue (Table 3). We compared these values to similar 
computed pfs for an additional set of carbamylated stand- 
ards made from human hemoglobin beta chains and a se- 
ries of rat liver and human plasma proteins of known posi- 
tion and sequence (Fig- 7,Table 4).The result demonstrates 
good concordance between these systems. Two proteins 
show significant deviations: liver fatty-acid binding protein 
(FABP; #1 in Table 4) and protein disulphide isomerase 
(#20 in the table). The FABP spot present on F344MST3 
may represent a charge-modified version of a more basic 
parent spot closer to the expected p/, not resolved in the 
1EF/SDS gel. Of particular importance is the fact that, by 
comparing computed pfs of sequenced but unlocated pro- 
teins with the CPK p^s, we can assign a probable gel loca- 
tion without making any assumptions regarding the actual 
gel pH gradient. This offers a useful shortcut, given the va- 
garies of pH measurement on small diameter IEF gels. We 
have used this approach to compute the CPK pPs of all rat 
and mouse proteins in the P1R sequence database, as an aid 
to protein identification (data not shown). 

In order to standardize SDS molecular weight (SDS-MW), 
we have used a standard curve fitted to a series of identified 
proteins (Fig. 8). Rather than using molecular mass per se, 
we have elected to use the number of amino acids in the 
polypeptide chain, as perhaps a better indication of the 
length of the SDS-coated rod that is sieved by the second 
dimension slab. The resulting values were multiplied by 
132 (the weighted average mass oi amino acids in se- 
quenced proteins) to give predicted molecular masses. Be- 
cause we use gradient slabs, we have not constrained the fit- 
ted curve to conform to any predetermined model; rather 
we tried many equations and selected the best using the 
program M Tablecurve , 'on a PC. The equation chosen was^ 
= a + 6x + c/x\ where y is the number of residues, x is the gel 



ycoordinate,o is 51 1.83, b is -0.2731 andcis 33183 801. The 
resulting fit appears to be fairly good over a broad range of 
molecular mass. 

3.3 An example of rat liver gene regulation: Cholesterol 
metabolism 

Experiment LSBC04 was designed as a small-scale test of 
the regulation of cholesterol metabolism in vivo by three 
agents included in the diet: lovastatin (Mevacor*, an inhibi- 
tor of HMG-CoA reductase); cholestyramine (a bile acid 
sequestrant that has the effect of removing cholesterol 
from the gut-liver recirculation); and cholesterol itself. The 
first two agents should lower available cholesterol and the 
third should raise it, allowing manipulation of relevant 
gene expression control systems in both directions. Such 
an experiment offers an interesting lest of the 2-D mapping 
system since most of the pathway enzymes are present in 
low abundance, many are membrane-bound and difficult 
to solubilize,and the pathwayitself is complex. Approxima- 
tely 1000 proteins were separated and detected in liver ho- 
mogenates. Twenty-one proteins were found to be affected 
by at least one treatment, and these could be divided into 
several coTegulated groups. 

3.3.1 MSN 413 (putative evtosolic HMG-CoA synthase) 
and sets of spots regulated coordinately or inversely 

One group of spots (including a spot assigned to the cyto- 
solic HMG-CoA synthase, MSN 413) showed the expected 
increase in abundance with lovastatin or cholestyramine, 
the synergistic further increase with lovastatin and choles- 
tyramine, and a dramatic decrease with the high cholesterol 
diet. Spot number .4 13 is the most strongly regulated pro- 
tein in the present experiment, showing a 5- to 10-fold in- 
duction after a 1 week treatment with 0.075% lovastatin and 
1 % cholestyramine in the diet (Figs. 9 and 10). Its expres- 
sion follows precisely the expectation for an enzyme whose 
abundance is controlled by the cholesterol level; it is pro- 
gressively increased from the control levels by cholestyra- 
mine, lovastatin and lovastatin plus cholestyramine, and it 
sinks below the threshold of detection in animals fed the 
high cholesterol diet. This spot has been tentatively identi- 
fied as the cytosolic HMG-CoA synthase, based on a reac- 
tion with an antiserum to that protein provided by Dr. Mi- 
chael Greenspan at Merck Sharp &Dohme Research Labo- 
ratories. This enzyme lies immediately before HMG-CoA 
reductase in the liver cholesterol biosynthesis pathway, and 
is known to be co-regulated with it. Spot 413 has an SDS 
molecular weight of about 54 000 and a CPK pi of -11.4, in 
reasonably close agreement with a molecular weight of 
57 300 and a CPK pi of -15.7 computed from the known se- 
quence of the hamster enzyme [43]. 

Using a classical product-moment correlation test (Kepler 
procedure CORREL), a series of five additional spots was 
found to be coregulated with 413. The level of correlation 
was exceedingly high (> 95%).Two of these, 1250 and 933, 
are at similar molecular weights and approximately one 
charge more acidic than 413 (Fig. 9), indicating that they 
may be covalently modified forms of the 413 polypeptide. 
This suspicion is strengthened by the observation that both 
spots are also stained by the antibody to cytosolic HMG- 
CoA synthase. The remaining three correlated spots appear 
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,„ , nmriT We 2 n additional related pair (1253 and 1001) of 
oS l^ »d i single spot (1119) of around 28 kDa. 
Because the<e two presumed proteins are present at sub- 

solic HMG-CoA synthase is reported to consist of only one 
vie of polypeptide, they are likely to represent other, very 
Lhtly ^co regulated enzymes. A second group of s« spots 

t, £ ntiy coregu re gulator)' pattern close to the in- 

*" n at for %ot 4 13 (MSN's 34, 79, 178, 182, 204, 347; 
verse of that for -pot inS)tbe lowest leV el of ex- 

data not shown). For the P ^ 
pressron occurs .,th exp e) ^ ^ ^ 

m ,ne and the highe ?? ^ h comlaltd and lie 

lesterol diet. SP°«^ 6 the same roo]ecular weight; they 

'X'Z ^oSSo^ single protein. The other four 

Tpots probably represent additional enzymes or subumts. 

3.3.2 MSN 235 and coregulsted spots 

A third group of five spots, mainly comprised of mitorfion- 
^ . , l,»in« including putative mitochondrial HMG- 
i Ua l SnS snoJ showed a modest induction by lovasta. 
G0A 3 bu liu°e or no effect with any of the other treat- 
men ^including the combination of lovastatin and choles- 
p; D i?i This result is intriguing because lovasta- 
«^ed 2 ^ffect only the regulation of enzymes of 

u i 7*Irni «vnthesis which is entirely extra-mitochon- 
ddaf ^ee of" be spots (235, 134, 144) form a closely- 
nactd triad at approximately 30 kDa, and are l.kely to re- 
present Soforms of one protein. All three ^re stained 
I „ Qn ,;hnfiv to the mitochondrial form of HMG-CoA 
svn ^setb alned Jrom Dr.Greenspan. Subcellular fractio- 

J r«« inrticiites a mitochondrial location. The other two 
n3 ^633 at about 38 kDa and 724 at about 69 kDa) are 
each present at lower abundance than the members of the 
triad. 

3.33 An example of an anti-synergistic effect 

a «Tth"si>ot (367) shows strong induction by lovastatin 
n i .„ threefold! and about half as much induction with 
choicsmarnine.but without sharing the ani- 
1 f?n Si helerbgeneity pattern of the 235-set (Fig. 13). 
StSSnS 22 mitochondria], and represents the clear- 
S eJarnS of na^ti-synergistic effect of lovastatin and 
hn^mamine The existence of such an effect demon- 
fra tes ^^SaUovastatin and cholestyramine do not act exclu- 
sively through the same regulatory pathway. 

3.3.4 Complexity of the cholesterol synthesis pathway 

Taken together, these results suggest that treatment with lo- 
, linne can affect both cytosolic and mitochondrial 
vastatin f one ca "^. C oA, while cholestyramine, on the 
£ „ ir alone or in combination with lovastatin, 
«5nr« » strone effect on the putative cytosolic pathway, 
Z Ht S ^ r no effect on the putative mitochondrial path- 
way An explanation for this difference may he in lovasta- 
S effect on levels of HMG-CoA and related precursor 
tin s effect on i te e d bet ween the cyiosol and 

C r P 7 ,hond on whereas cholestyramine should affect 
onhtec^ 

oUnd bSac d uvels.lt remains to be explained why some 
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protein' of the putative mitochondrial pathway are so 
much more variable in their expression in all groups. An ex- 
amination of all the coregulated groups suggests that quan- 
titative statistical techniques can extract a wealth of inter- 
esting information from large sets of reproducible gels.Tne 
abundance of spots in the 413 coregulation group.for exam- 
ple shows an amazing level of concordance in their relative < 
exp'ression among the five individuals of the lovastatin and 
cholestyramine treatment group. This effect is not due to 
differences in total protein loading, since they have already 
been removed by scaling, and since proteins with quite dif- 
ferent regulation patterns can be demonstrated (e.g., Fig. 
13) Such effects raise the possibility that many gene coregu- 
lation sets may be revealed through the study of a suffi- 
ciently large population of control animals (i.e., without 
any experimental manipulation).This approach, exploiting 
natural biological variation in protein expression instead of 
drug effects, offers an important incentive for the construc- 
tion of a large library of control animal patterns. 



4 Conclusions 

Because of the widespread use of rat liverin both basic bio- 
chemistry and in toxicology, there is a long-term need for a 
comprehensive database of liver proteins.The rat liver mas- 
ter pattern presented here has proven to be an accurate re- 
presentation of this system, having been matched to more 
than 700 gels to date. As the number of proteins identified 
and the number of compounds tested for gene expression 
effects grows, we expect this database to contribute valu- 
able insights into gene regulation.lts practical utility in sev- 
eral areas of mechanistic toxicology is already being de- 
monstrated. 

Received September 11, 1991 
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Figure 2. Schematic representation of the master pattern (the same as Fig. 1), useful as an aid in relating specific areas of Fig. \ and the following detailed 
quadrants. 
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Figure 3 Upper left (high molecular weight, acidic) quadrant (#1) of the rat liver map, showing spot numbers. 
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Figure 8. Plot of number of amino acids versus gel X-position, with fitted 
curve used to predict molecular mass of unidentified proteins. 



CPK position 



<4 Figure 7. (a) Plot of computed isoelectric point versus gel A'- position for 
two sets of carbamylated standard proteins (rabbit muscle CPK [+] and 
human hemoglobin P chain, filled diamonds) and several other proteins 
(shaded squares), (b) The identities of the various proteins represented 
by the squares are indicated by the numbers in corresponding positions 
on (a); these refer to Table 4. 
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f/^ure 9. Montage showing effects in the 
region of MSN:413.The montage shows a 
small window into one portion of the 2-D 
pattern, one row of windows for each expe- 
rimental group, and one panel for each gel 
in the experiment. The left-most pattern 
in each row is a group-specific copy of the 
master pattern followed by the patterns 
for the five individual rats in the group. 
The highlighted protein spots (filled circ- 
les) are spot 413 (on the right of each pan- 
el; identified as cylosolic HMG-CoA syn- 
thase) and two modified forms of it (1250 
and 933). From the top, the rows (experi- 
mental groups) are: high cholesterol, con- 
trols, cholestyramine, lovastatin, and lova* 
statin plus cholestyramine. 
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Figure 10, Bargraph showing the quantita- 
tive effects of various treatments on the 
25,000 30.000 abundancejof MSN:413 (cytosolic HMG- 
CoA synthase) in the gels of Fig. 9. . 
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Figure 1L Bargraphs of a series ofsix core- 
gulated spots including MSN:413. In the 
bargraphs. the abundances of the appro- 
priate spot (master spot number shown at 
the top of the panel) in each animal are 
shown. The five five-animal groups are in 
the order (left to right): high cholesterol, 
controls, cholestyramine, lovastatin, and 
lovastatin plus cholestyramine. Each bar 
within a group represents one experimen- 
tal animal liver (one 2-D gel).Note the cor- 
related expression of the 6 spots, espe- 
cially in the two far right (most strongly in- 
duced) groups. 
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Figure 13. Data on spot MSN:367, presented as in Fig. 11. This protein 
shows unambiguously the anti-synergistic effect of lovastatin and choles- 
tyramine (fifth group) as compared to lovastatin (fourth group).This res- 
ponse contrasts strongly with the regulation pattern seen in Fig. 11. 
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predicted molecular mass (from the standard curve of Fig. 8). 
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14,400 


AQ1 


1 QQ1 

1 5JWO 


P1Q 

CIS 


>0.0 


37 800 


osw 


1R9R 


03v 


fi A 

"OJ.V 


45 900 


Osf3 






- lO.O 


107 ocfi 


090 


1 S tLA 


71 * 
/I 9 


r\ ft 


49 7on 


CM 

by / 


^ OQ7 

i yy / 


o>49 


>U.v 


7fl AAft 


698 


OCT 


Cc^l 

OOO 


m a 


' C1 OATI 
31 ,OUU 


699 


1 540 




-4.4; 




702 


577 


900 


•23.8 


34,400 


703 


1 610 


562 


-3.2 


51,900 


7U3 


I iC/O 


. 3/' 


7 Q 


31 ,4MJ 


/Uo 


1 841 




-U./ 


>n ^AA 


707 


1 018 


1386 


-ii .7 


16 900 


709 


1074 


1145 


-10.7 


25/I00 


710 


293 


889 


<-35.0 


34.800 


712 


720 


412 


-18.5 


66.600 


713 


1386 


841 


-6.4 


36,800 


714 


1328 


263 


-7.1 


103.100 


715 


698 


433 


-19.1 


63.900 


716 


701 


481 


-19.0 


58.700 


717 


1875 


699 


-0.5 


43,600 


718 


575 


702 


•23.9 


43,400 


719 


1216 


204 


-8.6 


140.400 


721 


1069 


464 


-10.8 


60.400 



722 


1272 


506 


-7.9 


56,400 


723 


958 


822 


-13.0 


37.700 


724 


763 


395 


-17.3 


69,100 


725 


720 


916 


•18.5 


33.700 


726 


1476 


415 


-4.9 


66,200 


727 


1846 


473 


-0.7 


59.400 


728 


510 


783 


-27.3 


39,400 


729 


1217 


1126 


-8.6 


25,600 


730 


1858 


724 


-0.6 


42,300 


731 


665 


765 


-20.2 


40.300 


733 


1321 


312 


•7.2 


85,900 


734 


719 


427 


-18.5 


64,600 


735 


1101 


473 


•10.2 


59,500 


736 


1359 


569 


-6.7 


51 ,400 


738 


696 


220 


-19.2 


127.600 


739 


687 


409 


-19.5 


67,000 


740 


1205 


256 


-8.7 


106.200 


741 


995 


563 


-12.1 


51 ,900 


742 


898 


596 


-14.1 


49,500 


743 


881 


181 


-14.5 


165,900 


744 


1951 


686 


>0.0 


44,200 


745 


726 


168 


.-18.3 


183,600 


746 




643 


-12.0 


46.600 


748 


182 


1503 


<-35.0 


13,000 


749 


2005 


649 


>0.0 


46,300 


.750 


1448 


575 


-5.4 


51.000 


751 


792 


266 


-16.5 


101 ,900 


752 


469 


296 


-28.9 


90,600 


754 


664 


254 


-20.3 


107.000 


755 


1195 


184 


-8.8 


161,000 


756 


1821 


1113 


-0.9 


26.300 


757 


909 


246 


-13.9 


111.000 


760 


790 


133 


-16.5 


264.900 



926 



L Anderson ctat- 



Electrophoresis 1991, J2, 907-930 



MSN 



CPKDl sdsmw 



MSN 



Y CPKpl SDSMW 



MSN 



Y CPKpl SDSMW 



"7C1 
/Dl 


1399 


733 


'3.2 


41,600 


646 


/bo 


!• 1 v 


1085 


-5.9 


27.300 


649 


764 




569 


>0.0 


51.400 


£50 


765 


651 


47* 


-20.8 


5S.300 


651 


766 


1052 


1 149 


•11.1 


25.000 


652 


767 


1968 


468 




59,900 


655 


766 


1330 


685 


-7 1 


44,300 


656 


769 


1S70 


613 


ft 

Xf.v 


46 500 


657 


770 


857 


617 


-lO.w 


AP OCYS 


658 


771 


1337 


974 


7 ft 


^1 W 

J 1 ,3w 


659 


773 


1576 


502 


•a 7 
-3./ 


v 7rm 


ow 


775 


969 


824 


-1 Z.O 


J 1 


C*J I 


776 


1438 


708 


-D.O 


1ftft 


RA9 


777 


1539 


458 




Cl ftAft ' 
C l ,IAJU 


ow 


778 


£50 


434 


•1 D.l 


CC ,OKAJ 




779 


700 


411 


10 1 

- 1 


cjl bnft 


866 


780 


1052 


1136 


-1 1 .1 




DOO 


784 


1413 


529 


C. ft 
-C.v 




DOS 


785 


1364 


885 


-C. f 




Civ 


786 


1822 


835 


-0.9 


37.100 


C7i 


767 


893 


392 


-14.3 


69,500 


C/£ 


790 


616 


882 


•22.0 


35,100 


C /J 


791 


451 


1429 


-29.8 


15,400 


874 


792 


777 


377 


-16.9 


72,000 


875 


793 


1536 


1543 


•4.2 


11,700 


676 


794 


1461 


807 


-5.1 


38,300 


677 


796 


388 


546 


-33.6 


53.100 


E78 


797 


1126 


212 


-9.8 


133,700 


679 


798 


933 


437 


-13.5 


63,400 


680 


799 


1420 


593 


-5.9 


49,600 


881 


800 


1759 


279 


-1.6 


96,500 


863 


801 


624 


865 


-21.7 


35,800 


884 


802 


898 


547 


-14.2 


53,000 


865 


803 


1775 


1468 


-1.4 


14,200 


686 


804 


573 


196 


-24.0 


146,400 


887 


805 


203 


494 


<-35.0 


57,400 


688 


806 


980 


1039 


-12.5 


29,000 


689 


807 


902 


308 


-14.1 


87,200 


890 


808 


625 


827 


-21.7 


37,500 


891 


809 


1851 


1015 


-0.7 


29,900 


892 


810 


440 


573 


-30.9 


51,100 


B94 


811 


1358 


249 


-6.8 


109,700 


895 


812 


851 


393 


-15.1 


69,400 


696 


B13 


745 


1246 


-17.8 


21,600 


897 


814 


2028 


810 


>0.0 


38,200 


898 


815 


1086 


645 


-10.4 


46,500 


899 


616 


629 


313 


-21 .6 


85,700 


900 


817 


1376 


1177 


-6.5 


24,000 


901 


818 


1771 


790 


-1.4 


39,100 


903 


819 


1045 


263 


-11 .2 


103,100 


904 


820 


984 


362 


-12.4 


74,600 


905 


821 


1712 


279 


-2.2 


y©,/uu 




822 


1266 


205 


-8.1 






823 


1517 


654 


•4.4 




cm 


824 


1442 


449 


c c 




Q1 1 


825 


1240 


513 


C O 

-c.o 




Q1 1 


826 


1309 


1014 


7 A 




914 


827 


2012 


708 


>u.u 


43 100 


916 


828 


937 


1405 






917 


830 


1342 


756 


-7.0 


40 700 


919 


831 


562 


826 




^7 


GOO 


832 


1073 


1039 


1 n 7 

• i u. / 


29 000 


921 


833 


481 


620 




37 800 


923 


834 


501 


CD i 

DO 1 


-27.8 


50,500 


924 


837 


751 


748 


-17^6 


41,100 


925 


838 


635 


833 


-21 .3 


37.200 


926 


839 


1494 


459 


-4.7 


60,900 


927 


840 


1952 


301 


>0.0 


89.300 


928 


841 


1585 


1080 


-3.6 


27.500 


929 


842 


571 


1312 


-24.1 


19,400 


931 


843 


1325 


649 


-7.2 


46,300 


932 


844 


1727 


301 


-2.0 


89.200 


933 


845 


630 


679 


-21.5 


44.600 


934 


846 


2016 


905 


>0.0 


34,200 


936 


847 


673 


1200 


-19.9 


23,200 


937 



1863 
1166 
1535 
1035 
634 
499 
1063 
667 
1448 
706 
1070 
472 
674 
1307 
645 
827 
685 
1807 
1323 
1228 
1904 
556 
1540 
1566 
1198 
1076 
1161 
647 
1756 
1543 
1432 
922 
1103 
1501 
798 
636 
951 
717 
1123 
891 
1245 
1962 
1322 
420 
662 
845 
624 
931 
799 
765 
775 
888 
628 
681 
1544 
1606 
1237 
1442 
1260 
764 
1133 
1123 
829 
1131 
1441 
679 
1487 
1082 
1231 
1609 
810 
965 
947 
865 
1421 



271 

523 
1024 

826 

542 

220 

194 

890 

639 

311 
1066 

347 

480 

499 

867 
1004 

494 

402 

783 
1031 

346 

647 

756 

777 

351 

720 
1111 

757 

594 

278 

690 

689 
414 
607 

1103 
634 
759 
548 
229 
413 
234 
346 
626 
570 
428 
243 
703 

1094 
229 
520 
889 
824 

1303 

1544 
301 
387 
688 
749 
367 

1541 

1123 
380 
242 
316 
874 

. 219 
1191 
775 
816 
670 
900 
520 
462 
843 
1056 



-0.6 
-S.2 
-4.2 
-11.4 
-15.5 
-27.8 
-10.9 
-14.4 
-5.4 
-18.9 
-10.7 
•28.6 
-19.9 
•7.4 
-21 .0 
-15.6 
•19.5 
-1.0 
-7.2 
•8.4 
-0,3 
-24.8 
-4.2 ■ 
-3.8 
•6.6 
-10.6 
•9.3 
-20.9 
•1.6 
-4.1 
-5.7 
-13.7 
-10.1 
-4.6 
-16.3 
-21.3 
-13.1 
-18.6 
-9.8 
-14.3 
-8.2 
>0.0 
-7.2 
-31.4 
-20.3 
-15.3 
-21.7 
-13.5 
-16.3 
-17.2 
-17.0 
•14.4 
•15.6 
-19.7 
-4.1 
-3.3 
-8.3 
-5.5 
-8.0 
-17.3 
-9.7 
-9.8 
•15.6 
-9.7 
-5.5 
-19.7 
-4.8 
•10,5 
-8.4 
-3.3 
•16.0 
-12.8 
•13.2 
•14.8 
6.9 



99,500 
54,900 
29,600 
37.500 
53,400 
127.100 
150,500 
34,800 
46,900 
86,200 
28,000 
77,600 
56,800 
57.000 
34,900 
30.300 
57,400 
68.000 
39,400 
29.300 
77,700 
46.400 
40,700 
39.700 
76.800 
42,500 
26,400 
40,700 
49.700 
97,100 
34,800 
44,100 
66,400 
48,900 
26.600 
47,200 
40,600 
52,900 
121.200 
66,400 
117,800 
77,700 
47.700 
51.300 
64,500 
113,000 
43,400 
27,000 
121.000 
55.200 
34,800 
37.600 
19,700 
11,700 
89.100 
70.400 
44.100 
41,100 
73,700 
11,700 
25,900 
71,500 
113,200 
84,300 
35,400 
128,200 
23.500 
39.800 
38,000 
45,100 
34,400 
.55.100 
60,600 
36,800 
28.400 



939 
941 
942 
943 
944 



1197 
1765 
602 
312 
993 



627 
885 
472 



4.6 
-1.5 
-22.7 



496 <-35.0 
491 -12.1 



945 1300 

946 630 

947 187 

948 1380 

949 1766 

950 1038 



951 
952 
954 
955 
957 
959 
960 
961 
962 
963 
964 
965 
966 
967 
968 
969 
970 

971 

972 

974 

975 

976 

977 

978 

979 

980 

981 

983 

984 

985 

987 

988 

990 

991 

992 

993 
994 
995 
996 
997 

1000 
1001 
1002 
1003 
1006 
1007 
1009 
1010 
1011 
1012 
1013 
1014 
1015 
1016 
1017 
1018 
1020 
1021 
1022 
1023 
1024 
1025 



860 
957 
503 
1938 
1010 
768 
596 
557 
887 
564 
969 
671 
1204 
910 
609 
1285 
822 
976 
403 
279 
644 
1124 
994 
1612 
749 
1064 
1197 
1762 
1344 
1024 
739 
816 
785 
1159 
1090 
1030 
847 
902 
888 
1815 
1205 
617 
968 
970 
1736 
643 
822 
875 
291 
1386 
459 
679 
1818 
1032 
1629 
1311 
1722 
1015 
1574 
781 
1129 
812 
785 
1290 



269 
423 
736 
344 
665 
193 
152 
701 
547 
712 
816 
174 
419 
409 
320 
334 
1155 
255 
798 
154 
1048 
206 
232 
437 
567 



-7.5 
-21.6 
<-35.0 
-6.5 
-1.5 
-11.3 
-14.9 
•13.0 
•27.6 
>0.0 
-11.8 
-17.2 
-23.0 
-24.8 
-14.4 
•24.5 
-12.8 
-20.0 

-8.7 
-13.9 
-22.3 

•7.7 
-15.8 
-12.6 
•32.6 



495 <-35.0 
981 -15.3 



295 
664 
642 
1141 
642 
911 
1508 
317 
1105 
1159 
555 
361 
317 
928 
701 
811 
461 
847 
579 
504 
289 
290 
771 
478 
1184 
487 
279 
644 
745 
541 
661 
1128 
634 
994 
1134 
424 
743 
1219 
484 
83 
317 
446 
739 



-9.8 
-12.1 
-3.2 
-17.7 
-10.8 
-8.6 
-1.6 
-6.9 
-11.5 
-17.9 
•15.9 
-16.7 
-9.3 
-10.4 
•11.5 
-15.2 
-14.1 
-14.4 
-0.9 

-e.7 

•22.0 
12.8 
-12.7 
-1.9 
-21.1 
•15.8 
-14.6 
<-35.0 
-6.4 
-29.4 
-19.7 
-0.9 
•11.4 
-3.0 
-7.4 
-2.0 
•11.7 
-3.7 
•16.8 
-9.7 
•15.9 
•16.7 
-7.7 



37.500 
35.000 
59.600 
57,100 
57,700 
100.300 
65.100 
41,600 
78.200 
45.400 
151.000 
213.000 
43,400 
53,000 " 
42.900 
37,900 
174.900 
65.700 
67,100 
83,900 
80.500 
24,800 
106.600 
38,700 
210,300 
28,700 
138.900 
119,300 
63.400 
51.600 
57,400 
31,200 
91,100 
45.400 
46.700 
25,300 
46.700 
33,900 
12,800 
84700 
26,600 
24,600 
52,400 
74,900 
84,500 
33,300 
43,400 
36.200 
60.700 
36.600 
50,700 
56.500 
93,100 
92.700 
40,000 
58,900 
23.700 
58,100 
96,400 
46,600 
41,200 
53,500 
45,600 
25.800 
47.200 
30.700 
25,500 
65.000 
41,300 
22.500 
58.400 
591.300 
84.600 
62.400 
41,500 



Electrophoresis 199U «, 907-930 



Y CPKDl SDSMW 



MSN 


X 


Y 


CPKpl 


SDSMW 


1 1 

1 1 


921 


1 1 to 

1 1 90 




24 700 


1 1 *>A 

1 1 


1 664 


tci 


CD 


35 900 


1 161 


637 


400 


-91 ^ 


68 400 


1 l£9 
1 ID* 




09/ 


91 A 




1 1 DO 




/ 




DC, f\AJ 


) 1 DC 






9J A 
**4.4 


rj cnft 


1 1 /u 






ft 


u con 


11/1 


CIO 


OA 


-25) .9 


D4,oW 


1172 


C AC 

545 


514 


-25.5 


55,700 


1174 


1099 


522 


-10.2 


55,000 


1176 


1304 


586 


-7.5 


50,200 


1177 


1366 


539 


•6.6 


53,700 


1178 


1608 


702 


-3.3 


43,400 


1179 


1485 


224 


-4.8 


124,900 


1180 


1459 


224 


-5.2 


124,900 


1181 


1431 


223 


-5.7 


125,100 


11S2 


1407 


223 


-6.1 


125,200 


1183 


1383 


224 


-6.4 


124,700 


1164 


1454 


182 


-5.3 


164,400 


1185 


1422 


183 


-5.8 


162,600 


1186 


1394 


182 


-6.3 


164,300 


1189 


1171 


214 


-9.2 


131,800 


1190 


1457 


286 


-5.2 


64,200 


1191 


686 


1114 


-19.5 


26,200 


1192 


265 


693 


<-35.0 


34,700 


1193 


403 


1292 


-32.6 


20,000 


1164 


344 


1275 


<-35.0 


20,600 


1195 


505 


1311 


-27.6 


16,400 


1196 


572 


1263 


-24.1 


20,000 


1197 


639 


1502 


-21.2 


13,000 


1198 


637 


1402 


-21.3 


16,300 


1199 


614 


1407 


-22.1 


16.200 


1200 


637 


1431 


-21.3 


15,400 


1201 


1095 


1394 


-10.3 


16,600 


1202 


1719 


1545 


-2.1 


11,600 


1203 


791 


668 


-16.5 


45,200 


1204 


964 


1021 


-12.9 


29,700 


1205 


313 


195 


<-35.0 


148,700 


1206 


306 


194 


<-35.0 


149,800 


1209 


320 


197 


<-35.0 


147,400 


1210 


326 


197 


<-35.0 


146,600 


1211 


394 


294 


-33.2 


91,400 


1212 


402 


294 


-32.7 


91,200 


1214 


386 


294 


-33.7 


61,400 


1215 


641 


329 


-21.2 


81,600 


1216 


660 


329 


-20.4 


81,600 


1217 


914 


266 


-13.8 


1 01 .800 


1210 


873 


245 


-14.7 


112,000 


1219 


970 


372 


-12.7 


72.900 


1220 


1021 


298 


-11.6 


90,100 


1221 


1392 


205 


-6.3 


139,500 


1222 


1354 


203 


-6.8 


141,800 


1223 


1362 


205 


-6.7 


139,500 


1224 


673 


540 


-19.9 


53,600 


1225 


614 


542 


-22.1 


53,400 


1226 


603 


539 


-22.6 


53,600 


1227 


696 


623 


-19.2 


47,800 


1228 


707 


628 


-18.9 


47,500 


1229 


475 


447 


-28.7 


62.300 


1230 


466 


1282 


-29.0 


20,400 


1231 


759 


1461 


-17.4 


14,400 


1232 


1324 


1170 


-7.2 


24,200 


1233 


1583 


1005 


-3.6 


30,300 


1234 


1865 


809 


^0.6 


38,200 


1235 


1812 


817 


-1.0 


37,900 


1236 


1411 


703 


•6.0 


43,400 


1237 


1392 


682 


-6.3 


44,500 


1236 


794 


410 


-16.4 


66.900 


1239 


769 


407 


-17.1 


67.300 


1240 


740 


406 


-17.9 


67.500 


1241 


743 


511 


-17.8 


55.900 


1242 


713 


510 


-18.7 


56,000 


1243 


682 


509 


-19.6 


56,100 


1244 


663 


504 


-20.3 


56.500 


1245 


565 


582 


-24.4 


50.500 



Database of rat liver proteins 927 



MSN 


X 


Y 


CPKDl 


SDSMW 


1246 


547 


577 


-25.3 


50,800 


1247 


530 


576 


-26.3 


50,900 


1249 


516 


572 


-27.0 


51.200 


1250 


973 


536 


•12.7 


53.900 


1251 


607 


532 


-22.4 


54.200 


1252 


665 


529 


-20.2 


54,400 


1253 


899 


766 


-14.1 


40,200 


1254 


1311 


746 
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Tabic 3. Computed pfs of two sets of carbamylated protein standards: Rabbit muscle CPK and human 
hemoglobin (Hb) 
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Table 4, 



Computed pfs of some known proteins related to measured CPK pfs 



*ASP #GLU *H1S #LYS #ARG Calc Real 
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Protein Name 



Creatine phespho kinase (CPK), rabbit muscle 
Fatty acid-binding protein, rat hepatic 
b2-microQlobulin, human 
Carbamoyl-phosphate synthase, rat 
Prealbumin ( serum albumin precursor), rat 
Serum albumin, rat 

Superoxid dismutase (Cu-Zn, SOD), rat 
-Phcspholipsse-C, phophoinositide-specific (?), rat 
Albumin, human 
Apo A-l lipoprotein, rat 
proApo A-l lipoprotein, human 
NADPH cytochrome P-450 reductase, rat 
Retinol binding protein, human 
Actin beta, rat 
Actin gamma, rat 
Apo A-l lipoprotein, human 
Apo A-IV lipoprotein, human 
Tubulin alpha, rat 
FlATPase beta, bovine 
Tubulin beta, pig 

Protein disulphide isomerase (PDI), rat hepatic 

Cytochrome b5, rat 

Apo C-ll lipoprotein, human 
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